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.ABSTRACT 

A model for the integrated motftng^-' wej fi agea pLOC^i:^i > of 
order one, IMA (1, 1) , having a seasonal (cyclic) component is 
presented. The model incorporates a parameter for possible change in 
level of the process after intervention, following methods developed 
by Box and Tiao (1965), and Glass, Willson, and Gottmann (1972). 
Least-squares estimates and associated significance tests for the 
parameters of the model (in particular, the intervention parameter) 
are derived. The results of a computer study and an example from real 
data are given with analysis and interpretation of parameter 
estimates. Results of the theoretical derivation are extended to 
other models [IMA (1, 1) with deterministic drift, mult i -component 
models], and limitations of the model are noted. (Author) 
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Abstract 



The time-series experimental paradigm is extended to include processes 
which have seasonal variation. A basic linear model is examined which includes 
a seasonal component and intervention effect component. Transformation of 
observations puts the model into general linear form, which is amenable to 
solution by the method of least squares. Estimates for parameters of the model 
are derived, and confidence intervals formed around them. 

-A second method for dealj ng with the seasonal component is discussed. 
— T+TTS-i's based upon seasonal adj"ui>liiicia pi'lut' lu linibi-'oi.i'ics analyDio. — A-tfrtfcT" 
method, that of differencing, is noted. 

Applicability of these approaches in various parametric situations is 
checked by the use of simulations. Computer-generated time-series processes 
with different error variances and amplitudes were analyzed using the methods 
discussed. Results suggest that the first method works best in cases where 
error variance and amplitude are of the same order of magnitude. Seasonal 
adjustment seems better for situations when the amplitude is much 
larger than the error variance. Differencing was a poorer method in all cases. 



Many stochastic processes observed in the behavioral sciences have 
seasonal or cyclic components. Such periodic variation must be included 
in any model which attempts to explain such processes parametrically, A 
special case is the time-series experiment with seasonal component. 

GlasyTT nUotl, and Gottnxin (1072) . havo given a - general approach to 
design and analysis of interrupted time-series experiments. The problem of 
periodic variation was not considered, but its solution follows from the work 
presented there. 

Consider observation of a time-series process which follows a periodic 
upward and downward movement over time distinct from the normal variation 
in each observation. After a number of observations, an intervention is 
made into the process, and additional observations are made. Analysis of the 
data is now required. Certainly the observer is interested in changes in 
the process due to intervention. He might also be interested in estimates of 
the original level, period and amplitude of the seasonal variation, and in any 
changes in the process after intervention. Mathematically, a discrete stochastic 
process may be modelled by an auto-regressive-integrated moving-averages p rocess 
(ARIMA), following Box and Jenkins (1970). Glass et al (1972) note that for most 
processes, a simple model suffices: the integrated-moving averages process of order 
1 or 2, IMA (1,1) or IMA (2,2). The IMA (1,1) is a stationary process (doesn't 
wander around much), in which random shocks or errors change the level of 
the process, and then die out, except for a portion of each previous shock which 
remains permanently. For IMA (2,2) (which is non-stationary and wanders from 
level to level), a portion of each two previous shocks remains. Only IMA (1 J) 



2 



will be considered for the derivation of the model, but extension to IMA (2,2) 
is straightforward. 

Method 1 

At each time t prior to intervention, the realization of an IMA (1,1) 
process with seasonal variation may be represented as a linear sum of tenns, 
following Box and Jenkins (1972): « _ ' 

= L + ^ ^ '^^t ^ ^t 

level of process at the initial observation 
proportion of each shock (error) a^ retained in the series 
amplitude of the seasonal component 
argument of the sine term 
random shock (error) at time i, and 
0 

cov(a.aj) =0^1. 

The sine term is just the first term of a Fourier representation of a 
seasonal variation. It must be noted that such a term is deterministic , and 
as such more useful in fitting data than in forecasting future data (see Box 
and Jenkins, 1970; p. 301). The choice of a sine term rather than cosine 
term is arbitrary and changes only the phase estimate. 

After Intervention an additional term devoting change is needed. In 
the simpl'^st case this will be a change in level, 6. Then, 
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where L 



E(a.) 



' 1 
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= L + Xq ^|]a^. + A sin + 6 + 



(2) 



The argument of the sine term will be as function of ir, generally including 
a phase term: 



when t = time of observation 
K = phase constant 

P = period of the seasonal variation. 
The model is linear in the sense of additivity of components, and is thus 
amenable to solution by the method of least-squares, after transformation. The 
additional error or shock terms, ^|]a^, must be removed first. Box and^Tiao 
( 1965) first showed the form of the transformation, and Kepke (1972) has given 
a generalized method for any ARIMA process. For IMA (1,1), the transformed 
observation is given by 



(I). 




(3) 



z 



Z2 - zi - (Aq - i)y 



z 



t 



t-1 



- (Xq - i)y 



t-1 



(4) 



which may be expressed 



y^ = (1 - Xq)^-^ + (1 - AQ)^-^A«^in 0,^-)+ a^ 



(5) 



prior to intervention, and 



ERIC 



= (1 - \^)^'\ + (1 - XQ)''"'A<(sin io^'»+ (1 - XQ)'''"r'6 + (6) 



t-ni-l 



after intervention. Notice that n-, observations were made prior to intervention, 
and this fact is included in the intervention term. The y^'s are now in the form 
of the general linear model 



"y - Xti + E, where 



X = 



Intervention 



1 

(1-A) 
(1-A)2 



(1-A,)"1 



Sin us 



1 



f(sin wg) 
f(sin u^) 



(1-Aj"r^ f(sin a« ) 
0 



0 
0 



(7) 



(1-A,) 



(l-x,)"l^"2-l f(sin.^^,^^) (l.g"2-l 



e = 



L 
A 

6 



• (8) 



and 

f(sin (0^) = Jg^(l - x^)\in a)^_^._^ + sin w. 
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Note that the amplitude is being estimated and the period is assumed to be known. 
Practically, this will nearly always be true. 
The least-squares estimates 6 are given by 

. i = {VXyh^y. (9) 

No distribution theory for the errors has been mentioned. The assumption - 



of normality, generally reasonable, allows calculation of confidence intervals 
about the estimates L, A, and 6. The standard errors are given by 



3 

where c^^ is the jth diagonal entry of (X'X)"\ aind = — ^ , the 

a ni'»"n2'*^ 

estimated error variance of the a^. The 100(1 - a)% confidence interval is 



The derivation was made on the assumption of known parameter a^. In 
general this will not be met, and an iterative procedure must be used to 
estimate x^. Solutions of (9) may be made for values of (0)( .01){2.0) 
economically by computer, and a minimum variance criterion used to find the 
best solution, with an estimate of x^. Box and Tiao (1965) also show a 
Bayesian solution in which the posterior density function is maximized for 
x^ (see Glass et al 1972). 

The model for IMA (2,2) is slightly more complicated. Here, the process 
may wander from level to level, and it is thought that on theoretical grounds 
solution will be much less satisfactory since a seasonal term may well be con- 
founded with random changes in level. It is mentioned for completeness. The 
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are given by 
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= L + iil^i ' H \i j^l^j ' ^ '"t ' \ 



(11) 



Intervention will produce a 6 term. 

The transformation is slightly more complicated and was derived by Glass 
(1972). The transformed observation y^ are given by 



^1 = ^1 

yg = Zg - 2z^ - (a^ + A2 - 2)y^ 

and again, least-squares estimates may be calculated. Iterations over both 

and produce a two-dimensional plane of variances, and contour lines of 
equal variance may be used to estimate a^ and A2 (see Glass, 1972 , and 
Box and Jenkins, 1970, p. 212-213.) 

Method II 



Another approach to the problem of seasonality is to estimate the amplitude 
and period prior to least-squares analysis and remove the seasonal term, 
leaving a residual which maiy be analyzed as an IMA (1,1) process. Thus 



z|. = - A sin 0)^ (13) 



The estimate A maty be made by 



max(z^) - min(z.) 
A = ' — 75 — {or (v^ < t < 

where t is in some restricted range. A can be shown to be unbiased- The new 
observations z|. are now treated as a simple IMA (1,1) process (or IMA (2,2) 
if that is the case), and the z|. y|. by the transformation given in (4). Least- 
squares estimates for L and 6 may then be computed. 



Method III 

Box and Jenkins (1970) suggest the method of differencing to cope with 
seasonality. This in effect ignores the seasonal term by looking at residuals 



where p = period of the seasonal component. 

The residuals are treated as an IMA (1,1) process. The original intent 
of this method lay in getting better estimates in forecasting, and a more complicated 
multiplicative model was built from the pth differences. Such a model is not 
comtemplated here, and the differencing was considered the final step in 
this method. 

Method IV 

Yet another method of analysis is to ignore the seasonal component entirely 
and analyze the process as if it were a non-seasonal IMA (1,1). The seasonal 
component will increase the error variance, and give wider confidence intervals. 
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Comparison of Methods 

The four methods were used to analyze seven simulated processes which were 
identical except that error variance and amplitude were varied • Table 1 
gives a sunmary of the analyses. The processes were all IMA (1,1) processes 
built according to equation (1). The seven processes are shown in Figure 1, 

^ — Inspection of yag^siJts -for Case I reveals method I to have estimated the ^ 

process parameter (x^) and error variance best, although all estimates of L 
and 6 are good. The results for Case 2 again show method I to be best, with 
Methods II and IV somewhat worse. In Case 3 method II shows an error variance 
closer to the actual case and better estimate of 6, although method I gives 
a better estimate of amplitude. In Case 4 no method seems clearly superior 
to another, and all do rather poorly. Method I shows better estimates of 
A and than the others. Case 5 clearly demonstrates the superiority 
of analysis by Method II in the situation of small error variance and large 
sine term, and this is also true in Case 6 and Case 7, although better estimates 
by method I occur in these cases than in Case 5. 

The results suggest this conclusion: Method I is better when a small 
amplitude seasonal component exists with respect to error variance (same order 
of magnitudel; Method II is better when a large seasonal component is pre sent 
(with respect to error variance) . 

It is apparent that analysis is not nearly so precise as is possible 
without the existence of seasonal terms. The periodic component tends to 
confound examination of error variance, and quite probably one must be content with 
results less exact than is possible in other cases. 

Although no example was found which adequately fit the assumptions of the 
IMA (1,1) model, an analysis is presented to illustrate the procedure. Airline 



Table 1 



Comparison of Estimates by Four Methods for 
Seven Simulated Seasonal Processes 



Case 1 



Case 2 



Case 3 



Case 4 



Case 5 



Case 6 



Parameters 


Method 1 


Method II 


Method III 


Method 




(Linear sine 


(Seasonally 


(Di fferenced) 


(IMA (1 




term included) 


Adjusted; 




Undiffen 




Estimates 


Estimates 


Estimates 


Estimate 


L=bO 


49 • 6 


48.7 


C A C 

50.5 


/ C 1 

'^51 


6=20 


TO O 

19.2 


1 A O 

19.2 


1 ^ o 

16.2 


1 A 


x= 7 


65 






0 




1.3 


1.9 


1.1 




A=1 


1.0 


2.5 






t-statistic 










for 6=0 


17.7* 


13.75* 






L-bO 


DO. 1 


48. 7 


CO o 


C>l 1 

54.1 


0-cU 


OO 1 

22. 1 


0 1 ^ 

21.6 




1 A T 

19.7 


A= 7 


, 7 


1 01 (n=2) 






a 2=2 5 


34.0 


46.3 


'^'60 


31 .3 


A=l 


2.6 


6.5 






t-statisf. c 






• 




for 6=0 


3.60* 


2.58* 






1 — CA 

L=bO 


44.5 


il o ^ 

42.3 


43.9 


il O f\ 

43.9 


r — OA 

6=20 


31 .2 


20.1 


25.5 


25.5 


A= 7 

A • / 


Q5 


. 






a2=25 


46.1 


36.2 


51.2 


51 .2 


A=10 


10.2 


8.5 






t-statistic 










for 6=0 


4.52* 


3.71* 






L=50 


42.0 


36.2 


54.9 


45.7 


6=20 


31 .2 


11.3 


14.4 


17.3 






0^ . o 


0 1 . c. 


o J . 1 


A=.7 


.90 


1.03 


1 .21 


1.40 


A=20 


18.3 


25.8 






t-statistic 










for 6=0 


6.94* 


1.76 






1 — CA 

L=bO 


37.2 


43.2 


41 


35.5 


r —OA 

6=20 


18.6 


18.3 


18 


15.4 


X=.7 


1 .65 


.51 


0 


1 .81 


o2=l 


19.0 


3.4 


5 


27.6 


A=20 


7.9 


20.0 






t-statistic 










for 6=0 


5.62* 


10.65* 






L=50 


50.0 


43.1 


54.1 


56.1 


6=20 


36.4 


22.5 


27.0 


34.0 


.\=.7 


1.1 


.88 


.79 


1.45 


c.'=16 


29.7 


18.5 


26.7 


49.9 


A=20 


13.4 


24.0 






t-statistic 










for 5=0 


6.50* 


5.14* 
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Table 1 



Parameters 



Case 7 



Comparison of Estimates by Four Methods for 
Seven Simulated Seasonal Processes 

Method I Method II Method III 
( Seasonally { Di f ferenced) 



.Linear sine 
cerm included) 
Estimates 



Adjusted) 
Estimates 



Estimates 



L=50 


52.0 


51.2 


60 


6=20 


27.6 


20.6 


30 


A=.7 


.8 


.61 


0 


o2=25 


24.6 


22.5 


21 


A=5 


5.2 


5.5 




t-statistic 








for 6=0 


5.55* 


4.50* 





Method IV 
(IMA (1,1) 
Undi fferonced) 
lis ti mate; 

53.0 
27.0 
.47 
19.7 



.995^40 



= 2.70 



*p < .01 
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n 



passengers carried by major airlines between 1949 and 1951 by month vary seasonally. 
No real intervention is thought to have occurred, but intervention is hypothesized 
after Janua**' 1950. Thus, 6 = 0 for the analysis. The graph of the data 
(from Box and Jenkins, 1970), is given in Figure 2, Parameter estimates for 
methods I and II are summarized in Table 2. 

The error variances are comparable in both methods, but the amplitude 
is better estimated in method II. Inspection of the graph suggests the process 
to have amplitude larger than error variance, thus implying better fit will 
be gained from method II. The intervention effect was non-si gnificantly 
different from zero in both cases. 

Analysis of the data by Box and Jenkins (1970) was performed using a 
much more complicated multiplicative difference model. Error variances 
ar^ not reported. 

Summary 

The use of simple trigonometric terms to account for seasonality may 
have utility in analysis of time-series experimental data. Additional 
complications of the linear model by complex components may well obscure 
affects of intervention. Two methods of analysis are recommended. The first, 
analysis with a sine (or cosine) term included in least-square estimation 
works well with moderate seasonality — that is, error variance comparable to 
the amplitude of the sine component. The second method, seasonal adjustment 
prior to analysis, works well with seasonal variation large with respect to 
error variance. 
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Table 2 

Parameter Estimates for Total Airline Passengers 

Carried Between 1949 and 1951 

Method I Method II 

L 111.3 134.2 

6 9.2* 5.4* 

174.0 178 
A 7.9 24.0 

1.45 1.4 
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